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starER: a conceptual model for data warehouse design 
Nectaria Tryfona, Frank Busborg, Jens G. Borch Christiansen 
November 1999 Proceedings of the 2nd ACM international workshop on Data 
warehousing and OLAP 

Additional Information: full citation , abstract , references , citings , index 
terms 



Full text available: f?| pdf(742.63 KB) 



Modeling data warehouses is a complex task focusing, very often, into internal structures 
and implementation issues. In this paper we argue that, in order to accurately reflect the 
users. requirements into an error-free, understandable, and easily extendable data 
warehouse schema, special attention should be paid at the conceptual modeling phase. 
Based on a real mortgage business warehouse environment, we present a set of user 
modeling requirements and we discuss the involved concepts. Under ... 

Keywords: ER model, conceptual modeling, data warehouse, star structure 



A tightly coupled approach to design and data management 
Flavio R. Wagner, Lia G. Golendziner, Miguel R. Fornari 

September 1994 Proceedings of the conference on European design automation 

Full text available: W[ pdf(794.13 KB) Additional Information: full citation , references , citings , index terms 



3 A comparison of data warehousing methodologies 
Arun Sen, Atish P. Sinha 

March 2005 Communications of the ACM, Volume 48 issue 3 

Full text available: ■ pipdff 11 7. 81 KB) ...... M _ + . . n .. . . . _ . . . 

,W [\ ' „ AA ' Additional Information: full citation , abstract , references , index terms 
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Using a common set of attributes to determine which methodology to use in a particular 
data warehousing project. 

4 Designing data marts for data warehouses 

October 2001 ACM Transactions on Software Engineering and Methodology (TOSEM) ; 

Volume 10 Issue 4 

Additional Information: full citation , abstract , references , index terms . 



Full text available: m pdf(203.43 KB) 
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Data warehouses are databases devoted to analytical processing. They are used to support 
decision-making activities in most modern business settings, when complex data sets have 
to be studied and analyzed. The technology for analytical processing assumes that data are 
presented in the form of simple data marts, consisting of a well-identified collection of facts 
and data analysis dimensions (star schema). Despite the wide diffusion of data warehouse 
technology and concepts, we still miss me ... 

Keywords: conceptual modeling, data mart, data warehouse, design method, software 
quality management 



Dynamic maintenance of multidimensional range data partitioning for parallel data 
processing 

Junping Sun, William I. Grosky 

November 1998 Proceedings of the 1st ACM international workshop on Data 
warehousing and OLAP 

Full text available: *P| pdf(1.09 MB) Additional Information: full citation , references , index terms 



Deriving initial data warehouse structures from the conceptual data models of the 

underlying operational information systems 
Michael Boehnlein, Achim Ulbrich-vom Ende 

November 1999 Proceedings of the 2nd ACM international workshop on Data 

warehousing and OLAP 

_ ii , , mu.su J(M ,n»/.m Additional Information: full citation , abstract , references , citings , index 

Full text available: "pn pdf(1.40 MB) 

fc^a terms 

In recent years the construction of large scale data schemes for operational systems has 
been the major problem of conceptual data modeling for business needs. Multidimensional 
data structures used for decision support applications in data warehouses have rather 
different requirements to data modeling techniques. In case of operational systems the data 
models are created from application specific requirements. The data models in data * 
warehouses base on the analytical requirements of the use ... 

Keywords: conceptual data model, data warehouse, decision support system, entity 
relationship model (ERM), snowflake scheme, star scheme, structured entity relationship 
model (SERM) 



A strategy for the integration of object-oriented data modeling into the undergraduate 

database course 
Bruce J. Neubauer 

March 2001 Journal of Computing Sciences in Colleges, volume 16 issue 3 

r- .. * a. -i u. tfsn mo-, oo i/ D \ Additional Information; full citation , abstract , references , citings , index 

Full text available: "pi pdf(67.38 KB) - — : 

L - h ^ terms 

Computer programming and business-systems design are increasingly embracing the object 
paradigm. It is important that students learn both traditional methods and object methods 
in the undergraduate database course. Object-oriented concepts such as generalization 
hierarchies and aggregation require a substantially different mindset than does the 
traditional course content. Most database textbooks are attempting to integrate the 
presentation of the traditional and the object-oriented concepts. ... 

Event-entity-relationship modeling in data warehouse environments 
Lars Baekgaard 
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November 1999 Proceedings of the 2nd ACM international workshop on Data 
warehousing and OLAP 

Full text available- fP I pdf(634 98 KB) Additional Information: full citation , abstract , references , citings , index 
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We use the event-entity-relationship model (EVER) to illustrate the use of entity-based 
modeling languages for conceptual schema design in data warehouse environments. EVER 
is a general-purpose information modeling language that supports the specification of both 
general schema structures and multi-dimensional schemes that are customized to serve 
specific information needs. EVER is based on an event concept that is very well suited for 
multi-dimensional modeling because measurement data oft ... 

Keywords: data warehousing, event modeling, information modeling, multi-dimensional 
modeling, star schemes 



9 Physical design: Physical modeling of data warehouses using UML 
Sergio Lujcin-Mora, Juan Trujillo 

November 2004 Proceedings of the 7th ACM international workshop on Data 
warehousing and OLAP 

Full text available: ^ pdf(640.48 KB) Additional Information: full citation , abstract , references , index terms 

During the few last years, several approaches have been proposed to model different 
aspects of a Data Warehouse (DW), such as the conceptual model of the DW, the design of 
the ETL (Extraction, Transformation, Loading) processes, the derivation of the DW models 
from the enterprise data models, etc. At the end, a DW has to be deployed to a database 
environment and that takes many decisions of a physical nature. However, few efforts have 
been dedicated to the modeling of the physical design (i.e ... 

Keywords: UML, component, configuration, data warehouse, deployment, physical design 




10 Industrial sessions: commercial implementation techniques: Efficient execution of joins 
in a star schema 

Andreas Weininger 

June 2002 Proceedings of the 2002 ACM SIGMOD international conference on 
Management of data 

Full text available: " H pdf(349.43 KB) Additional Information: full citation , abstract , references , index terms 

A star schema is very popular for modeling data warehouses and data marts. Therefore, it 
is important that a database system which is used for implementing such a data warehouse 
or data mart is able to efficiently handle operations on such a schema. In this paper we will 
describe how one of these operations, the join operation — probably the most important 
operation — is implemented in the IBM Informix Extended Parallel Server (XPS). 

11 An overview of data warehousing and OLAP technology 
Surajit Chaudhuri, Umeshwar Dayal 

March 1997 ACM SIGMOD Record, volume 26 issue l 

Full text available: ^| pdf(101.60 KB) Additional Information: full citation , abstract , citings , index terms 

Data warehousing and on-line analytical processing (OLAP) are essential elements of 
decision support, which has increasingly become a focus of the database industry. Many 
commercial products and services are now available, and all of the principal database 
management system vendors now have offerings in these areas. Decision support places 
some rather different requirements on database technology compared to traditional on-line 
transaction processing applications. This paper provides an overview ... 
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12 A horizontal fragmentation algorithm for the fact relation in a distributed data 
warehouse 

Amin Y. Noaman, Ken Barker 

November 1999 Proceedings of the eighth international conference on Information and 
knowledge management 

Full text available- W\ pdf(986 60 KB) Additional Information: full citation, abstract, references , citings, index 

: terms 

Data warehousing is one of the major research topics of appliedside database investigators. 
Most of the work to date has focused on building large centralized systems that are 
integrated repositories founded on pre-existing systems upon which all corporate-wide data 
are based. Unfortunately, this approach is very expensive and tends to ignore the 
advantages realized during the past decade in the area of distribution and support for data 
localization in a geographically dispersed corporate s ... 

Keywords: distributed data warehouse architecture, distributed data warehouse design, 
horizontal fragmentation 



13 Industrial sessions: big data: TPC-DS, taking decision support benchmarking to the 
next level 

Meikel Poess, Bryan Smith, Lubor Kollar, Paul Larson 

June 2002 Proceedings of the 2002 ACM SIGMOD international conference on 
Management of data 

Full text available- W\ df(645 38 KB) Additional Information: full citation , abstract , references , citings , index 
' ! terms 

TPC-DS is a new decision support benchmark currently under development by the 
Transaction Processing Performance Council (TPC). This paper provides a brief overview of 
the new benchmark. The benchmark models the decision support functions of a retail 
product supplier, including data loading, multiple types of queries and data maintenance. 
The database consists of multiple snowflake schemas with shared dimension tables; data is 
skewed; and the query set is large. Overall, the benchmark is conside ... 

Keywords: TPC, benchmark, data warehouse, decision support, performance evaluation 



14 Modelling stars using XML |j§ 
Jaroslav Pokorny 

November 2001 Proceedings of the 4th ACM international workshop on Data 
warehousing and OLAP 

Full text available: ^ pdf(2.29 MB) Additional Information: full citation , abstract , references , index terms 

We suppose collections of XML data described by Document Type Definitions (DTDs). This 
data has been generated by applications and plays a role of OLTP database(s). A star 
schema, a well-known technique used in data warehousing, can be applied. Then dimension 
information is supposed to be contained in XML data. We will use the notions of subDTD 
and view, and formulate referential integrity constraints in XML environment. We use 
simple pattern matching capabilities of current XML query languages ... 

Keywords: XML, data warehouse, dimension, star schema 



15 Articles: Reconsidering Multi-Dimensional schemas 
Tim Martyn 

March 2004 ACM SIGMOD Record, volume 33 issue l 
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Full text available: ^ pdfd 63.67 KB) Additional Information: full citation , abstract , references , citings 

This paper challenges the currently popular "Data Warehouse is a Special Animal" 
philosophy and advocates that practitioners adopt a more conservative "Data 
Warehouse=Database" philosophy. The primary focus is the relevancy of Multi-Dimensional 
logical schemas. After enumerating the advantages of such schemas, a number of caveats 
to the presumed advantages are identified. The paper concludes with guidelines and 
commentary on implications for data warehouse design methodologies. 

16 Physical design: Handling big dimensions in distributed data warehouses using the 

DWS technique 

Marco Costa, Henrique Madeira 

November 2004 Proceedings of the 7th ACM international workshop on Data 
warehousing and OLAP 

Full text available: ^ pdf(288.33 KB) Additional Information: full citation , abstract , references , index terms 

The DWS (Data Warehouse Striping) technique allows the distribution of large data 
warehouses through a cluster of computers. The data partitioning approach partition the 
facts tables through all nodes and replicates the dimension tables. The replication of the 
dimension tables creates a limitation to the applicability of the DWS technique to data 
warehouses with big dimensions. This paper proposes a strategy to handle large dimensions 
in a distributed DWS system and evaluates the proposed str ... 

Keywords: data warehousing, distributed query execution 



17 Reports: Report on the ACM fourth international workshop on data warehousing and |j| 

OLAP (DOLAP 2001) 
Joachim Hammer 

June 2002 ACM SIGMOD Record, Volume 31 issue 2 

Full text available: g pdf(413.34 KB) Additional Information: full citation , abstract 

The Fourth Annual ACM International Workshop on Data^Warehousing and Online Analytical 
Processing (DOLAP 2001) was held in Atlanta, GA, USA, in November 2001, in conjunction 
with the Tenth International Conference on Information and Knowledge Management (CIKM 
2001). Although this was only the fourth annual meeting, DOLAP has already become an 
important and broadly accepted forum for researchers and practitioners to share their 
findings in theoretical foundations, current methodologies, practical ... 

18 Workshop reports: Report on the ACM fourth international workshop on data 

warehousing and OLAP (DOLAP 2001) 
Joachim Hammer 

April 2002 ACM SIGIR Forum, volume 36 issue l 
Full text available: pdfd 05. 96 KB) Additional Information: full citation 




i9 On relationships offering new drill-across possibilities 
Alberto Abello, Jose Samos, Felix Saltor 

November 2002 Proceedings of the 5th ACM international workshop on Data 
Warehousing and OLAP 

Full text available- W\ pdf(236 84 KB) Additional Information: full citation , abstract , references , citings , index 
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OLAP tools divide concepts based on whether they are used as analysis dimensions, or are 
the fact subject of analysis, which gives rise to star shape schemas. Operations are always 
provided to navigate inside such star schemas. However, the navigation among different 
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stars is usually overlooked. This paper studies different kinds of Object-Oriented conceptual 
relationships (part of UML standard) between stars (namely Derivation, Generalization, 
Association, and Flow) ... 

Keywords: UML, drill-across, multidimensional design, semantics 



20 XPS a database server for data warehousing 
Andreas Weininger 

November 2001 Proceedings of the 4th ACM international workshop on Data 
warehousing and OLAP 

Full text available: ^) pdf(621.63 KB) Additional Information: full citation , abstract , citings , index terms 

A database server used for implementing a data warehouse must support other features 
than a database server used for OLTP. Therefore, in this paper we will look specifically at 
features necessary for efficiently processing queries on a database with a star schema 
model, a database scheme which is used very often in data warehousing. We will especially 
analyze the features provided for this by the IBM Extended Parallel Server (XPS). There are 
special star join methods like the Push-Down Hash Semi ... 
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